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Methods for Producing L-Amino Acids 
Cross-Reference to Related Applications 

This application is a nonprovisional of U.S. Application No. 60/145,217, 
filed July 23, 1999 and U.S. Application No. 60/150,017, filed August 20, 1999, 
both of which contents are incorporated herein by reference. 

Background of the Invention 

Field of the Invention 

The present invention relates, in general, to a method of producing L- 
amino acids and to a gene encoding phosphoglucoisomerase. 

Background Information 

Bacterial cells are used industrially to produce amino acids by 
fermentation processes (Ishino, S. et al, J. Gen. Appl Microbiol 57:157-165 
(1991), Kinoshita, S., Nakayama, K. and Nagasaki, S., J. Gen. Appl. Microbiol 
4:128-129 (1958)). Although numerous research reports and reviews have 
appeared concerning fermentation processes and the mechanisms of accumulation 
of amino acids, more progress needs to be made to increase the yields of amino 
acids from microorganisms (Ishino, S.etal,J. Gen. Appl Microbiol 37: 1 57-165 
(1991), Aida, K. et al, eds., "Biotechnology of Amino Acid Production," 
Kodansha (Tokyo)/Elsevier (New York) (1986) and Marx, A. et al, Metabolic 
Engineering 7:35-48 (1999)). 

There has been some success in using metabolic engineering to direct the 
flux of glucose derived carbons toward aromatic amino acid formation (Flores, 
N. et al, Nature Biotechnol 74:620-623 (1996)). However, the successful 
application in producer strains has not yet been documented (Berry, A., TIBTECH 
74:250-256(1996)). 

Metabolic engineering relates to manipulation of the flow of carbons of 
starting materials, such as carbohydrates and organic acids, through the variety 



of metabolic pathways during fermentation. Studies have been done, for 
example, on the central metabolism of Corynebacterium glutamicum using 13 C 
NMR studies (Ishino, S. et al , J. Gen. Appl. Microbiol 57:157-165(1991), Marx, 
A. et al, Biotechnology and Bioengineering 49:\\ 1-129 (1996)). Additionally, 
also using 13 C NMR, Walker et al (Walker, T. et al,J. Biol Chem. 257:1 189- 
1195 (1982)) analyzed glutamic acid fermentation by Microbacterium 
ammoniaphilum, and Inbar et al (Inbar, L. et al, Eur. J. Biochem, 149:601-601 
(1985)) studied lysine fermentation by Brevibacterium flavum. 

The present invention solves a problem of improving yields of amino 
acids during fermentation using metabolic engineering. 

Summary of the Invention 

The present invention provides a method of producing L-amino acids by 
culturing altered bacterial cells having increased amounts of NADPH as 
compared to unaltered bacterial cells, whereby L-amino acid yields from said 
altered bacterial cells are greater than yields from unaltered bacterial cells. 

The present invention also provides a method of producing a bacterial cell 
with a mutated phosphoglucose isomerase (pgi) gene comprising (a) subcloning 
an internal region of the pgi gene into a suicide vector; and (b) inserting said 
suicide vector into a bacterial genome, via homologous recombination, whereby 
a bacterial cell with an altered pgi gene is produced. The invention further 
provides an altered bacterial cell produced according to this method. 

The invention also provides a vector useful according to this method. 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the Corynebacterium glutamicum 
phosphoglucose isomerase polypeptide having the amino acid sequence shown 
in Figure 1 (SEQ ID NO: 2) or one of the amino acid sequence encoded by the 
DNA clone deposited in a bacterial host as NRRL Deposit Number B-30174 on 
August 17, 1999. 

The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 



containing the recombinant vectors, as well as to methods of making such vectors 
and host cells and for using them for production of Pgi polypeptides or peptides 
by recombinant techniques. 

The invention further provides an isolated Pgi peptide having an amino 
acid sequence encoded by a polynucleotide described herein. 

Further advantages of the present invention will be clear from the 
description that follows. 

Brief Description of the Figures 

Figure 1A-1C shows the nucleotide (SEQ ID NO: 1) and deduced amino 
acid (SEQ ID NO:2) sequences of pgi. The Pgi peptide has a deduced molecular 
weight of about 59 KDa. 

Detailed Description of the Invention 

It has been determined herein that increased amounts of NADPH in a 
bacterial cell increase product yield, specifically in anabolic processes where 
NADPH is a limiting factor. A way of carrying chemical energy from reactions 
of catabolism to the energy-requiring reactions of biosynthesis, such as the 
formation of amino acids, is in the form of hydrogen atoms or electrons. To be 
effective as reducing agents, hydrogen atoms must have considerable free energy. 
Such high-energy hydrogen atoms are obtained from cell fuels by 
dehydrogenases, which catalyze removal of hydrogen atoms from fuel molecules 
and their transfer to specific coenzymes, particularly to the oxidized form of 
nicotinamide adenine dinucleotide phosphate (NADP + ). The reduced, or 
hydrogen-carrying, form of this coenzyme, designated NADPH, is a carrier of 
energy-rich electrons from catabolic reactions to electron-requiring bio synthetic 
reactions. 

The present invention provides a method for producing L-amino acids by 
culturing altered bacterial cells having increased amounts of NADPH as 



compared to unaltered bacterial cells whereby L-amino acid yields from said 
altered bacterial cells are greater than yields from unaltered bacterial cells. 
Preferred amino acids are L-ly sine, L -threonine and L-isoleucine. As used herein, 
an altered bacterial cell is defined as a bacterial cell which has increased amount 
5 of NADPH as compared to an unaltered bacterial cell. 

In one preferred embodiment, an "altered" bacterial cell is a "mutated" 
bacterial cell. A "mutation" is any detectable change in the genetic material 
which can be transmitted to daughter cells. A mutation can be any (or a 
combination of) detectable, unnatural change affecting the chemical or physical 
1 0 constitution, mutability, replication, phenotypic function, or recombination of one 

or more deoxyribonucleotides; nucleotides can be added, deleted, substituted for, 
inverted, or transposed to new positions with and without inversion. Mutations 
q can occur spontaneously and can be induced experimentally by application of 

JX mutagens or recombinant DNA technology. A mutant variation of a nucleic acid 

j^l 5 molecule results from a mutation. A mutant polypeptide can result from a mutant 

41 nucleic acid molecule. 

jJJ Additionally, an altered or mutated bacterial cell can be genetically 

;L "mutated" to yield an increased amount of NADPH as compared to the genetically 

Sj "unmutated" cell. 

til 

r[20 An increased amount of NADPH in the altered bacterial cell results in 

increased production of amino acids. Preferably, in an altered bacterial cell, 
amino acid yields are increased over yields from the unaltered cell from greater 
than about 1%, and preferably from about 1% to about 100%, preferably from 
about 2% to about 80%, and more preferably, from about 5% to about 60%, and 
25 even more preferably from about 10% to about 80%. As used herein, "yield" is 

defined as grams of amino acid produced, multiplied by 100, divided by grams 
of glucose consumed. 

In agreement with the present invention, the altered bacterial cell of the 
present invention is cultured in a culture medium that comprises a carbon source 
30 and a nitrogen source. The carbon source can be, for example, arabinose, 

cellobiose, fructose, glucose, lactose, maltose, mannose, rhamnose, raffinose, 
sorbose, sucrose, trehalose, pyruvate, or succinate. The carbon source is 
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preferably at an initial concentration of 0.1 to 10%, preferably 0.5 to 6.0% by 
weight. All of the carbon source can be added to the medium before the start of 
culturing, or it can be added step by step or continuously during culturing. 

The medium used herein can be solid or liquid, synthetic (/. e. man-made) 
5 or natural, and contains sufficient nutrients for the cultivation of the altered 

bacterial cell of the present invention. Preferably, the medium employed is a 
liquid medium, more preferably a synthetic liquid medium. 

The natural or synthetic culture media used in the above and below 
described embodiments of the invention also contain a nitrogen source, suitable 

1 0 inorganic salts, and, as appropriate, various trace nutrients, growth factors and the 

like suitable for cultivation of the altered bacterial cell, and can also contain at 
least one supplementary carbon source. The amount of each of these additional 
ingredients to be employed is preferably selected to maximize amino acid 
production. Such amounts can be determined empirically by one skilled in the art 

1 5 according to the various methods and techniques known in the art. 

Illustrative examples of suitable supplemental carbon sources include, but 
are not limited to: other carbohydrates, such as glucose, fructose, sucrose, starch 
or starch hydrolysate, cellulose hydrolysate and molasses; organic acids, such as 
acetic acid, propionic acid, lactic acid, formic acid, malic acid, citric acid, and 

20 fumaric acid; and alcohols, such as glycerol, inositol, mannitol and sorbitol. 

Illustrative examples of suitable nitrogen sources include, but are not 
limited to: ammonia, including ammonia gas and aqueous ammonia; ammonium 
salts of inorganic or organic acids, such as ammonium chloride, ammonium 
nitrate, ammonium phosphate, ammonium sulfate and ammonium acetate; urea; 

25 nitrate or nitrite salts, and other nitrogen-containing materials, including amino 

acids as either pure or crude preparations, meat extract, peptone, fish meal, fish 
hydrolysate, corn steep liquor, casein hydrolysate, soybean cake hydrolysate, yeast 
extract, dried yeast, ethanol-yeast distillate, soybean flour, cottonseed meal, and 
the like. 

30 Illustrative examples of suitable inorganic salts include, but are not 

limited to: salts of potassium, calcium, sodium, magnesium, manganese, iron, 
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cobalt, zinc, copper, molybdenum, tungsten and other trace elements, and 
phosphoric acid. 

Illustrative examples of appropriate trace nutrients, growth factors, and the 
like include, but are not limited to: coenzyme A, pantothenic acid, pyridoxine- 
HC1, biotin, thiamine, riboflavin, flavine mononucleotide, flavine adenine 
dinucleotide, DL-6,8-thioctic acid, folic acid, Vitamin B 12 , other vitamins, bases 
such as adenine, uracil, guanine, thymine and cytosine, L amino acids, sodium 
thiosulfate, p- or r-aminobenzoic acid, niacinamide, nitriloacetate, and the like, 
either as pure or partially purified chemical compounds or as present in natural 
materials. Cultivation of the inventive microorganism strain can be accomplished 
using any of the submerged fermentation techniques known to those skilled in the 
art, such as airlift, traditional sparged -agitated designs, or in shaking culture. 

The culture conditions employed, including temperature, pH, aeration rate, 
agitation rate, culture duration, and the like, can be determined empirically by one 
skilled in the art to maximize amino acid production. The selection of specific 
culture conditions depends upon factors such as medium composition and type, 
culture technique, and similar considerations. 

After cultivation for a sufficient period of time, until one or more kinds 
of amino acids that have accumulated in the cells and/or culture broth can be 
isolated according to any of the known methods including ion exchange 
chromatography, gel filtration, solvent extraction, affinity chromatography, or any 
combination thereof. Any method that is suitable with the conditions employed 
for cultivation can be used. 

Preferred bacterial cells are Cory ne bacterial species and Escherichia coli. 
Preferred among bacterial cells are Coryne bacterium glutamicum cells. As used 
herein, Brevibacterium flavum and Brevibacterium lactofermentum are 
synonymous with Coryne bacterium glutamicum. 

In the present invention, in general, increased NADPH within a 
microorganism is achieved by altering the carbon flux distribution between the 
glycolytic and pentose phosphate pathways of that organism. As used herein, 
"carbon flux" refers to the number of glucose molecules which proceed down a 
particular metabolic path relative to competing paths. 



Preferably, NADPH availability is increased by increasing the carbon flux 
through the oxidative branch of the pentose phosphate pathway. Theoretically, 
1 2 NADPH 's are generated per glucose when glucose is exclusively metabolized 
in the pentose phosphate pathway, but only two NADPH' s are produced per 
5 glucose metabolized in the TCA cycle (tricarboxylic acid, also called the citric 

acid cycle). Ishino, S. et al. , J, Gen. Appl. Microbiol 57:157-165 (1991). The 
present invention provides a method of producing L-amino acids by culturing an 
altered bacterial cell which has an increase in the carbon flux through the pentose 
phosphate pathway. 

1 0 Most of the glucose catabolized in living organisms proceeds through 

glycolysis resulting in the formation of pyruvate. The pentose phosphate 
pathway, also called the hexose monophosphate shunt, is an alternative route for 
glucose catabolism. The pentose phosphate pathway produces NADPH and 
under lysine fermentation conditions is more active. Ishino, S. et aL, J. Gen. 

15 Appl. Microbiol. 57:157-165 (1991). 

In the present invention, an altered bacterial cell can be one in which 
carbon flux though the oxidative branch of the pentose phosphate pathway is 
increased. Specifically, in the present invention, an altered bacterial cell can be 
one which has an increased amount of one or more enzymes involved in the 

20 pentose phosphate pathway. Such pentose phosphate enzymes are selected from 

the group comprising glucose 6-phosphate dehydrogenase, transketolase, 
transaldolase, ribulose 5-phosphate-3-epimerase, ribulose 5-phosphate isomerase 
and 6-phosphogluconate dehydrogenase, and 6-phosphogluconolactonase. 

In a preferred embodiment, the present invention further provides a 

25 method of producing L-amino acids by culturing an altered bacterial cell with an 

increased amount of malic enzyme relative to an unaltered cell. Malic enzyme 
catalyzes the reaction of malate withNADP + to produce pyruvate, carbon dioxide, 
NADPH and H + . 

In a preferred embodiment, the present invention further provides a 
30 method of producing L-amino acids by culturing an altered bacterial cell with an 

increased amount of isocitrate dehydrogenase relative to an unaltered cell. 
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Isocitrate dehydrogenase catalyzes the reaction of isocitrate with NADP + to 
produce oc-ketoglutarate, carbon dioxide, NADPH and H + . 

Both glycolysis and the pentose phosphate pathway compete for glucose. 
In the present invention, an altered bacterial cell can be one in which a decrease 
or blockage of the carbon flux though glycolysis results in an increase in the 
carbon flux though the oxidative branch of the pentose phosphate pathway. As 
used in the present invention, an altered bacterial cell can be one in which a 
decrease in carbon flux through glycolysis is achieved through decreasing the 
amount of one or more enzyme(s) involved in glycolysis. Preferred enzymes are 
6-phosphoglucose isomerase, fructose diphosphate aldolase, D-glyceraldehyde 
phosphate dehydrogenase, phophoglycerate kinase, phosphoglycerate mutase, 
endolase or pyruvate kinase. A preferred enzyme is 6-phosphoglucose isomerase. 

A preferred method of decreasing the amount of a glycolytic enzyme in 
an altered bacterial cell is by mutating the gene which encodes the enzyme. As 
used herein, preferred is blocking (null) or weakening (decreased) expression of 
the gene encoding 6-phosphoglucose isomerase ("Pg*")- 

A preferred method of blocking (null) or weakening (decreased) 
expression of genes encoding enzymes involved in glycolysis is by using suicide 
vectors (also called integrative vectors). As used herein, a suicide vector is 
defined as a vector which does not replicate autonomously within a particular 
organism, which then is introduced into the cell and recombines into a 
homologous region of the organism's chromosome to cause insertional 
inactivation of the homologous gene. Insertional inactivation of the gene is 
achieved by disrupting the reading frame of the gene. Insertional inactivation of 
the gene occurs only if an internal portion of the gene is used as the homologous 
region. 

Recombinant constructs can be introduced into the bacterial cells of the 
present invention using well known techniques such as transduction, transfection, 
transvection, conjugation, electroporation, electrotransformation, calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
mediated transfection, and transformation or other methods. Such methods are 
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described in many standard laboratory manuals, such as Davis et aL, "Basic 
Methods in Molecular Biology," (1986). 

In a preferred embodiment, the altered bacterial cell is produced by (a) 
subcloning an internal region of the pgi gene into a suicide vector; and (b) 
inserting said suicide vector into a bacterial genome via homologous 
recombination. An internal region can be defined as a contiguous DN A sequence 
between but not including the initiation codon and final codon of the open reading 
frame (ORF) in question. Preferably an internal region is chosen which will 
facilitate genomic integration and result in the expression of a non-functional 
polypeptide from the ORF in question. 

In certain preferred embodiments, the suicide vectors can be inducible, 
mutant-specific and/or condition-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, 
such as temperature and nutrient additives. Other suitable environmental factors 
will be readily apparent to the skilled artisan. 

The altered bacterial cells of the present invention can be transformed 
with suicide vectors which optionally include at least one marker gene. Such 
markers include amikacin, augmentin (amoxicillin plus clavulonic acid), 
ampicillin, cefazolin, cefoxitin, ceftazidime, ceftiofur, cephalothin, 
chloramphenicol, enrofloxacin, erythromycin, florfenicol, gentamicin, imipenem, 
kanamycin, sarafloxicin, tetracycline, ticarcillin, streptomycin, spectinomycin, 
hygromycin, trimethoprim or tilmicosin resistance genes. Preferred markers 
include chloramphenicol and/or kanamycin resistance genes. Other suitable 
markers will be readily apparent to the skilled artisan. 

An illustrative example of the use of suicide vectors is as follows: an 
internal region of a gene is amplified via the polymerase chain reaction, and the 
fragment resulting from the amplification is subcloned into a suicide vector which 
includes an antibiotic resistance marker gene, and the suicide vector is 
transformed into the original organism. The recovery of antibiotic resistant 
clones implies insertional inactivation of the homologous gene. The suicide 
vector used can include any plasmid incapable of autonomous replication in the 
target organism. In cases where the target organism is not Escherichia coli, Col 



El based replicons are preferred. Among Col El based replicons pBGS131 
(American Type Culture Collection (ATCC), Manassas, VA, Deposit No. 37443) 
is preferred. 

In a preferred embodiment, the present invention further provides a 
5 method of producing a bacterial cell with a mutated pgi gene. In a particularly 

preferred embodiment, the invention provides a method of producing a bacterial 
cell with a mutated pgi gene comprising (a) subcloning an internal region of the 
pgi gene into a suicide vector; and (b) inserting said suicide vector into a bacterial 
genome via homologous recombination whereby a bacterial cell with an altered 
10 pgi gene is produced. 

In a further embodiment, the present invention provides a bacterial cell 
produced according to the above-described methods. 
i==i An illustrative example of production of an altered bacterial cell follows. 

:|j A region of the Coryne bacterium glutamicum (C glutamicum) pgi gene which 

ilJi 5 encodes 6-phosphoglucose isomerase (a glycolytic enzyme), is amplified by PCR 

> using suitable primers. Preferably, the PCR primers are those listed in SEQ ID 

% NO:3 and SEQ ID NO:4, which contain the recognition sequence for the 

restriction enzyme Hind III. Following restriction with Hind III, the PCR product, 
Si is then subcloned into the suicide vector pBGS131. The resulting subclone is 

P;20 designated pDPTpgi2. The subclone pDPTpgi2 is then transformed into C. 

O glutamicum and kanamycin resistant colonies are selected for on appropriate 

media. The isolation of kanamycin resistant colonies implies that an integration 
event has occurred. Predominantly the integration occurs via homologous 
recombination resulting in disruption of the pgi gene. 

25 Another preferred method of producing an altered bacterial cell is by 

blocking or weakening expression of the appropriate gene through alteration of 
the promoter in front of the gene. Preferred is by using a different promoter from 
any source or changing the nucleotide sequence of the native promoter. Preferred 
among methods of changing the nucleotide sequence of the native promoter is 

30 PCR mutagenesis. Among known bacterial promoters suitable for this use in the 

present invention include the E. coli lac\ and lacL promoters, the T3 and T7 
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promoters, the gpt promoter, the lambda P R and P L promoters, the trp promoter, 
the tac promoter or promoters endogenous to the bacterial cells of the present 
invention. Also preferred is upregulation of genes encoding enzymes involved 
in the pentose phosphate pathway. This can be done by alteration of the promoter 
5 controlling the gene such that a stronger promoter than the native promoter is 

used. Another preferred way of upregulating the genes of the pentose phosphate 
pathway would be increasing the copy number of the genes in question through 
the use of genomic integration or autonomously replicating plasmids. 

In a preferred embodiment, the present invention also provides a method 
10 of producing L-amino acids comprising culturing an altered bacterial cell, 

wherein said bacterial cell is a Coryne bacterium glutamicum cell with a gene 
selected from the group consisting of a mutant pgi gene. 
q Another preferred method of producing an altered bacterial cell compri ses 

]~{ mutating a gene which encodes an enzyme involved in glycolysis to produce 

fill 5 blocked or weakened expression of the gene encoding the glycolytic enzyme. 

J= Illustrative examples of suitable methods for preparing mutated genes include, but 

^ are not limited to: PCR mutagenesis, in vitro chemical mutagenesis, 

^ oligonucleotide mutagenesis, mutagenesis by irradiation with ultraviolet light or 

SJ X-rays, or by treatment with a chemical mutagen such as nitrosoguanidine 

1"20 (N-methyl-N'-nitro-N- nitrosoguanidine), methylmethanesulfonate, nitrogen 

y mustard and the like; gene integration techniques, such as those mediated by 

insertional elements or transposons or by homologous recombination of 
transforming linear or circular DNA molecules; and transduction mediated by 
bacteriophages such as PI. These methods are well known in the art and are 
25 described, for example, in J.H. Miller, Experiments in Molecular Genetics, Cold 

Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1972); J.H. 
Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 
30 Sambrook, E.F. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 

Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York ( 1989); P.B. Kaufman et al , Handbook of Molecular and Cellular Methods 
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in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glick and J.E. Thompson, eds., 
CRC Press, Boca Raton, Florida (1993); and P.F. Smith-Keary, Molecular 
Genetics of Escherichia coli, The Guilford Press, New York, NY (1989). 
5 In a preferred embodiment, the present invention further provides an 

isolated or purified bacterial cell comprising a mutated pgi gene. 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a Pgi polypeptide having the amino acid 
sequence shown in Figure 1 (SEQ ID NO:2). The nucleotide sequence shown in 
10 Figure 1 (SEQ ID NO:l) can be obtained by sequencing the DNA clone, which 

was deposited on August 17, 1999 at the Agricultural Research Service Culture 
Collection (NRRL) under the terms of the Budapest Treaty, 1815 North 

o University Street, Peoria, Illinois 61604, USA and given accession number B- 

30 1 74. The deposited clone is in the p4 1 - 1 3(C0 1 ) plasmid. 

llil 5 The present invention provides an isolated nucleic acid molecule selected 

: |: from the group consisting of: 

gj (a) a polynucleotide encoding a polypeptide comprising amino acids 

L from about 1 to about 540 in SEQ ID NO:2; 

Sj (b) a polynucleotide encoding a polypeptide comprising one of the 

j^20 amino acid sequences encoded by the DNA clone contained in 

jp( NRRL Deposit No. B-30174; 

(c) the complement of (a) or (b); 

(d) a polynucleotide variant created by altering the polynucleotide of 
(a), wherein: 

25 (1) said altering includes a nucleotide insertion, 

deletion, or substitution, or any combination 
thereof; and 

(2) the number of alterations is equal to or less than 
5 % of the total number of nucleotides present in 
30 (a); 
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(e) a polynucleotide variant created by altering the polynucleotide of 
(b), wherein: 

(1) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination 

5 thereof; and 

(2) the number of alterations is equal to or less than 5 
% of the total number of nucleotides present in 
(b); 

(f) a polynucleotide variant created by altering the polynucleotide of 



10 (c), wherein: 

(1) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination 
thereof; and 

|j! (2) the number of alterations is equal to or less than 5 

1 5 % of the total number of nucleotides present in (c). 



% The present invention further provides the above nucleic acid molecule 

^1 wherein said polynucleotide has the complete nucleotide sequence in SEQ ID 

CS NO:l. 

The present invention further provides the above nucleic acid molecule 
^20 wherein said polynucleotide has the nucleotide sequence in SEQ ID NO:l 

r| encoding the Pgi polypeptide having the complete amino acid sequence in SEQ 

ID NO:2. 

The present invention further provides the above nucleic acid molecule 
wherein said polynucleotide has a nucleotide sequence encoding the Pgi 
25 polypeptide encoded by a DNA clone contained inNRRL Deposit No. B-301 74. 

The present invention further provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide having a nucleotide sequence identical to a 
nucleotide sequence in (a), (b), or (c) of the above nucleic acid molecule, 
3 0 wherein said polynucleotide which hybridizes does not hybridize under stringent 
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hybridization conditions to a polynucleotide having a nucleotide sequence 
consisting of only A residues or of only T residues. 

The present invention further provides a method for making a 
recombinant vector comprising inserting the above isolated nucleic acid 
5 molecule into a vector. 

The present invention further provides a vector comprising the above 
nucleic acid molecule. The present invention further provides a method of 
making a recombinant host cell comprising introducing the above vector into a 
host cell. The present invention further provides a host cell comprising the 
1 0 above vector. The present invention further provides a method for producing a 

Pgi polypeptide, comprising culturing the above recombinant host cell under 
conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

The present invention further provides an isolated polypeptide selected 
1 5 from the group consisting of: 

(a) a polypeptide comprising amino acids from about 1 to 
about 540 in SEQ ID NO:2; 

(b) a polypeptide comprising the amino acid sequence 
encoded by the DNA clone contained in NRRL Deposit 

20 No. B-30174; 

(c) a polypeptide variant created by altering the amino acid 
sequence of (a), wherein: 

(1) said altering includes an insertion, deletion, or 
substitution, or any combination thereof; and 
25 (2) the number of alterations is equal to or less than 5 

% of the total number of amino acids present in 

(a); 

(d) a polypeptide variant created by altering the 
polynucleotide of (b), wherein: 

30 (1) said altering includes an insertion, deletion, or 

substitution, or any combination thereof; and 
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(2) the number of alterations is equal to or less than 5 
% of the total number of amino acids present in 
(b). 

Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using an automated DNA 
sequencer (such as the Model 373 from Applied Biosystems, Inc.), and all amino 
acid sequences of polypeptides encoded by DNA molecules determined herein 
were predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this 
automated approach, any nucleotide sequence determined herein may contain 
some errors. Nucleotide sequences determined by automation are typically at 
least about 90% identical, more typically at least about 95% to at least about 
99.9% identical to the actual nucleotide sequence of the sequenced DNA 
molecule. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a 
frame shift in translation of the nucleotide sequence such that the predicted amino 
acid sequence encoded by a determined nucleotide sequence will be completely 
different from the amino acid sequence actually encoded by the sequenced DNA 
molecule, beginning at the point of such an insertion or deletion. 

Using the information provided herein, such as the nucleotide sequence 
in Figure 1, a nucleic acid molecule of the present invention encoding a Pgi 
polypeptide may be obtained using standard cloning and screening procedures. 

Thus, the present invention provides a nucleotide sequence encoding the 
Pgi polypeptide having the amino acid sequence encoded by the clone contained 
in the host identified as NRRL Deposit No. B-30174 and as shown in Figures 1 
(SEQ IDNOs:l and 2). 

As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors, the predicted Pgi polypeptide encoded by the deposited clone 
comprise about 540 amino acids, but may be anywhere in the range of 500 to 580 
amino acids. 
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As indicated, nucleic acid molecules of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 
may be the coding strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of isolated 
DNA molecules include recombinant DNA molecules maintained in heterologous 
host cells or purified (partially or substantially) DNA molecules in solution. 
Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA 
molecules of the present invention. Isolated nucleic acid molecules according to 
the present invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) shown in Figure 1 (SEQ ID 
NO:l); DNA molecules comprising the coding sequence for the Pgi protein 
shown in Figures 1 (SEQ ID NO:2); and DNA molecules which comprise a 
sequence substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode the Pgi protein. Of course, the 
genetic code is well known in the art. Thus, it would be routine for one skilled 
in the art to generate such degenerate variants. 

In addition, the invention provides nucleic acid molecules having 
nucleotide sequences related to extensive portions of SEQ ID NO: 1 . 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the Pgi polypeptide having an amino acid sequence encoded by the 
nucleic acid molecule deposited as NRRL Deposit No. B-30174 on 
August 17, 1999. The invention further provides an isolated nucleic acid 
molecule having the nucleotide sequence shown in Figure 1 (SEQ ID NO:l) or 
the nucleotide sequence of the pgi genomic sequence contained in the 
above-described deposited clone, or a nucleic acid molecule having a sequence 
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complementary to one of the above sequences. Such isolated molecules, 
particularly DNA molecules, are useful as probes for gene mapping, by in situ 
hybridization with chromosomes. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic 
acid molecule having the nucleotide sequence of the deposited clone or the 
nucleotide sequence shown in Figure 1 (SEQ ID NOs: 1) is intended fragments at 
least about 1 5 nucleotides (nt), and more preferably at least about 20 nt, still more 
preferably at least about 30 nt, and even more preferably, at least about 40 nt in 
length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50, 75, 100, 125, 150, 175,200, 225,250,300,325,350, 
375, 400, 425, 450, 475, 500, 525, 550 nt in length are also useful according to 
the present invention as are fragments corresponding to most, if not all, of the 
nucleotide sequence of the deposited clone or as shown in Figure 1 (SEQ ID 
NOs: 1 ). By a fragment at least 20 nt in length, for example, is intended fragments 
which include 20 or more contiguous bases from the nucleotide sequence of the 
deposited clone or the nucleotide sequence as shown in Figure 1 (SEQ ID NO: 1). 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the deposited clone contained in NRRL 
Deposit B-30174. By "stringent hybridization conditions" is intended overnight 
incubation at 42 °C in a solution comprising: 50% formamide, 5x SSC (150 mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x 
Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in 0.1 x SSC at about 65 °C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide 
is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 
1 5 nucleotides (nt), and more preferably at least about 20 nt, still more preferably 
at least about 30 nt, and even more preferably about 30-70 nt of the reference 
polynucleotide. These are useful as diagnostic probes and primers as discussed 
above and in more detail below. 
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By a portion of a polynucleotide of "at least 20 nt in length," for example, 
is intended 20 or more contiguous nucleotides from the nucleotide sequence of 
the reference polynucleotide (e.g., the deposited clone or the nucleotide sequence 
as shown in Figure 1 (SEQ ID NO:l). Of course, a polynucleotide which 
5 hybridizes only to a poly A sequence (such as the 3' terminal poly (A) tract of the 

pgi cDNA shown in Figure 1 (SEQ ID NO:l)), or to a complementary stretch of 
T (or U) residues, would not be included in a polynucleotide of the invention used 
to hybridize to a portion of a nucleic acid of the invention, since such a 
polynucleotide would hybridize to any nucleic acid molecule containing a poly 
10 (A) stretch or the complement thereof (e.g., practically any double-stranded 

cDNA clone). 

As indicated, nucleic acid molecules of the present invention which 
encode a Pgi polypeptide may include, but are not limited to those encoding the 

■ amino acid sequence of the polypeptide, by itself; the coding sequence for the 

^15 polypeptide and additional sequences, such as those encoding an amino acid 

leader or secretory sequence, such as a pre-, or pro- or prepro- protein sequence; 
the coding sequence of the polypeptide, with or without the aforementioned 
additional coding sequences, together with additional, non-coding sequences, 
including for example, but not limited to non-coding 5' and 3 ' sequences, such 

20 as the transcribed, non-translated sequences that play a role in transcription, 

mRNA processing, for example - ribosome binding and stability of mRN A; an 
additional coding sequence which codes for additional amino acids, such as those 
which provide additional functionalities. Thus, the sequence encoding the 
polypeptide may be fused to a marker sequence, such as a sequence encoding a 

25 peptide which facilitates purification of the fused polypeptide. In certain 

preferred embodiments of this aspect of the invention, the marker amino acid 
sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector 
(Qiagen, Inc.), among others, many of which are commercially available. As 
described in Gentz et aL, Proc. Natl Acad. Set USA 55:821-824 (1989), for 

30 instance, hexa-histidine provides for convenient purification of the fusion protein. 

The "HA" tag is another peptide useful for purification which corresponds to an 
epitope derived from the influenza hemagglutinin protein, which has been 
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described by Wilson etal, Cell 37: 767 (1984). As discussed below, other such 
fusion proteins include the Pgi fused to Fc at the - or C-terminus. 

The probes, primers, and/or nucleic acid fragments described above can 
be used to monitor expression of the pgi gene during fermentation. 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or derivatives 
of the Pgi protein. Variants may occur naturally, such as a natural allelic variant. 
By an "allelic variant" is intended one of several alternate forms of a gene 
occupying a given locus on a chromosome of an organism. Genes II, Lewin, B., 
ed. , John Wiley & Sons, New York (1 985). Non-naturally occurring variants may 
be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions, which may involve one or more nucleotides. The variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the 
coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and 
activities of the Pgi protein or portions thereof. Also especially preferred in this 
regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 
95%, 96%, 97%, 98% or 99% identical to (a) a nucleotide sequence encoding the 
polypeptide having the amino acid sequence in SEQ ID NO:2; (b) a nucleotide 
sequence encoding the full-length Pgi polypeptide having the complete amino 
acid sequence encoded by the clone contained in NRRL Deposit No. B-30174; 
or (c) a nucleotide sequence complementary to any of the nucleotide sequences 
in (a) or (b). 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a Pgi polypeptide 
is intended that the nucleotide sequence of the polynucleotide is identical to the 
reference sequence except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the reference nucleotide 
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sequence encoding the Pgi polypeptide. In other words, to obtain a 
polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may 
be deleted or substituted with another nucleotide, or a number of nucleotides up 
to 5% of the total nucleotides in the reference sequence may be inserted into the 
reference sequence. These mutations of the reference sequence may occur at the 
5' or 3 ' terminal positions of the reference nucleotide sequence or anywhere 
between those terminal positions, interspersed either individually among 
nucleotides in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in Figure 1 or to the nucleotides sequence of the deposited clones 
can be determined conventionally using known computer programs such as the 
Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, WI 53711. Bestfit uses the local homology algorithm of Smith and 
Waterman, Advances in Applied Mathematics 2: 482-489 (1 98 1 ), to find the best 
segment of homology between two sequences. When using Bestfit or any other 
sequence alignment program to determine whether a particular sequence is, for 
instance, 95% identical to a reference sequence according to the present 
invention, the parameters are set, of course, such that the percentage of identity 
is calculated over the full length of the reference nucleotide sequence and that 
gaps in homology of up to 5% of the total number of nucleotides in the reference 
sequence are allowed. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Figure 
1 (SEQ ID NO:l) or to the nucleic acid sequence of the deposited clone, 
irrespective of whether they encode a polypeptide having Pgi activity. This is 
because even where a particular nucleic acid molecule does not encode a 
polypeptide having Pgi activity, one of skill in the art would still know how to use 
the nucleic acid molecule, for instance, as a hybridization probe or a polymerase 
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chain reaction (PCR) primer. Uses of the nucleic acid molecules of the present 
invention that do not encode a polypeptide having Pgi activity include, inter alia, 
isolating the pgi gene or allelic variants thereof in a genomic library and Northern 
Blot analysis for detecting pgi mRNA expression. 
5 Preferred, however, are nucleic acid molecules having sequences at least 

95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
Figure 1 (SEQ ID NO:l) or to the nucleic acid sequence of the deposited clone 
which do, in fact, encode a polypeptide having Pgi protein activity. By "a 
polypeptide having Pgi activity" is intended polypeptides exhibiting activity 
10 similar, but not necessarily identical, to an activity of the Pgi protein of the 

invention, as measured in a particular biological assay. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
O in the art will immediately recognize that a large number of the nucleic acid 

jtj molecules having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to 

! L ~15 the nucleic acid sequence of the deposited clone or the nucleic acid sequence 

4: shown in Figure 1 (SEQ ID NO: 1) will encode a polypeptide "having Pgi protein 

JIJ activity." In fact, since degenerate variants of these nucleotide sequences all 

^ encode the same polypeptide, this will be clear to the skilled artisan even without 

Sj performing the above described comparison assay. It will be further recognized 

1720 in the art that, for such nucleic acid molecules that are not degenerate variants, a 

y reasonable number will also encode a polypeptide having Pgi protein activity. 

This is because the skilled artisan is fully aware of amino acid substitutions that 
are either less likely or not likely to significantly effect protein function (e.g., 
replacing one aliphatic amino acid with a second aliphatic amino acid). 
25 For example, guidance concerning how to make phenotypically silent 

amino acid substitutions is provided in Bowie, J. U. et al , "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," 
1:1306-1310 (1990), wherein the authors indicate that proteins are surprisingly 
tolerant of amino acid substitutions. 
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Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of Pgi polypeptides 
or fragments thereof by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. If the vector is a virus, it may be packaged in 
vitro using an appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters to 
name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, 
termination and, in the transcribed region, a ribosome binding site for translation. 
The coding portion of the mature transcripts expressed by the constructs will 
preferably include a translation initiating at the beginning and a termination 
codon (UAA, UGA or UAG) appropriately positioned at the end of the 
polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include but are not limited to kanamycin 
chloramphenicol, tetracycline or ampicillin resistance genes for culturing in E. 
coli and other bacteria. Representative examples of appropriate hosts include, but 
are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among known bacterial promoters suitable for use in the production of 
proteins of the present invention include the E, coli lad and lacZ promoters, the 
T3 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the 
trp promoter. 

Thus, the present invention is also directed to expression vector useful for 
the production of the proteins of the present invention. 
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Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. 
Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, such 
as Davis et aL, Basic Methods In Molecular Biology (1986). 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques 
in the art. 

The Pgi protein can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include naturally purified products, products of chemical synthetic procedures, 
and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and 
mammalian cells. Depending upon the host employed in a recombinant 
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production procedure, the polypeptides of the present invention may be 
glycosylated or may be non-glycosylated. In addition, polypeptides of the 
invention may also include an initial modified methionine residue, in some cases 
as a result of host-mediated processes. 



5 Pgi Polypeptides and Fragments 

The invention further provides an isolated Pgi polypeptide having the 
amino acid sequence encoded by the deposited clone, or the amino acid sequence 
in Figure 1 (SEQ ID NO: 2), or a peptide or polypeptide comprising a portion of 
the above polypeptides. 
10 It will be recognized in the art that some amino acid sequences of the Pgi 

polypeptide can be varied without significant effect of the structure or function 
p of the protein. If such differences in sequence are contemplated, it should be 

j~ remembered that there will be critical areas on the protein which determine 

activity. 

[gzl 5 Thus, the invention further includes variations of the Pgi polypeptide 

JIJ which show substantial Pgi polypeptide activity or which include regions of Pgi 

^ protein such as the protein portions discussed below. Such mutants include 

%\ deletions, insertions, inversions, repeats, and type substitutions. As indicated 

I*f above, guidance concerning which amino acid changes are likely to be 

J^20 phenotypically silent can be found in Bowie, J.U., et al, "Deciphering the 

Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
247:1306-1310(1990). 

Thus, the fragment, derivative or analog of the polypeptide of Figure 1 
(SEQ ID NO: 2), or the Pgi polypeptide encoded by the deposited clone, may be 
25 (i) one in which one or more of the amino acid residues are substituted with a 

conserved or non-conserved amino acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one 
encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is 
30 fused with another compound, such as a compound to increase the half-life of the 

. polypeptide (for example, polyethylene glycol), or (iv) one in which the additional 
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amino acids are fused to the mature polypeptide, such as an IgG Fc fusion region 
peptide or leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope of those skilled in the 
art from the teachings herein. 

As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding 
or activity of the protein (see Table 1). 



TABLE 1 . Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above. Generally 
speaking, the number of amino acid substitutions for any given Pgi polypeptide 
will not be more than 50, 40, 30, 20, 10, 5, or 3. 

Amino acids in the Pgi protein of the present invention that are essential 
for function can be identified by methods known in the art, such as site-directed 
mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 
244: 1081-1085 (1989)). The latter procedure introduces single alanine mutations 
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at every residue in the molecule. The resulting mutant molecules are then tested 
for phosphoglucose isomerase activity. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. By "isolated polypeptide" 
is intended a polypeptide removed from its native environment. Thus, a 
polypeptide produced and/or contained within a recombinant host cell is 
considered isolated for purposes of the present invention. Also intended as an 
"isolated polypeptide" are polypeptides that have been purified, partially or 
substantially, from a recombinant host cell. For example, a recombinantly 
produced version of the Pgi polypeptide can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 

The polypeptides of the present invention include the Pgi polypeptide 
encoded by the deposited DNA and polypeptides which are at least 95% identical, 
still more preferably at least 96%, 97%, 98% or 99% identical to the polypeptides 
encoded by the deposited clone, to the polypeptide of Figure 1 (SEQ ID NO:2), 
and also include portions of such polypeptides with at least 30 amino acids and 
more preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a Pgi polypeptide is 
intended that the amino acid sequence of the polypeptide is identical to the 
reference sequence except that the polypeptide sequence may include up to five 
amino acid alterations per each 100 amino acids of the reference amino acid of 
the Pgi polypeptide. In other words, to obtain a polypeptide having an amino acid 
sequence at least 95% identical to a reference amino acid sequence, up to 5% of 
the amino acid residues in the reference sequence may be deleted or substituted 
with another amino acid, or a number of amino acids up to 5% of the total amino 
acid residues in the reference sequence may be inserted into the reference 
sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among residues 
in the reference sequence or in one or more contiguous groups within the 
reference sequence. 
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As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in Figures 1 (SEQ ID NO: 2) or to the amino acid sequence encoded by deposited 
clones can be determined conventionally using known computer programs such 
the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, WI 53711). When using Bestfit or any other sequence alignment 
program to determine whether a particular sequence is, for instance, 95% 
identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in 
homology of up to 5% of the total number of amino acid residues in the reference 
sequence are allowed. 

The polypeptide of the present invention could be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. 

N-terminal and C-terminal Deletion Mutants 

In one embodiment, the present invention provides polypeptides having 
one or more residues deleted from the amino terminus of the amino acid sequence 
of the Pgi polypeptide depicted in Figure 1 or encoded by the DNA of the 
deposited clone. Particularly, in one embodiment, N-terminal deletions of the Pgi 
polypeptide can be described by the general formula m to 540, where m is any 
one of the integers from 2 to 539 corresponding to the position of the amino acid 
residue identified in SEQ ID NO:2 and, preferably, corresponds to one of the 
N-terminal amino acid residues identified in the N-terminal deletions specified 
herein. Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

Further embodiments of the invention are directed to C-terminal deletions 
of the Pgi polypeptides of the invention, described by the general formula 1 to n, 
where n is any one of the integers from 2 to 539 corresponding to the position of 
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amino acid residue identified in SEQ ID NO: 2, and preferably corresponds to a 
residue identified in one of the C-terminal deletions specified herein. 
Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

Further embodiments of the invention are directed to polypeptide 
fragments comprising, or alternatively, consisting of, amino acid residues 
described by the general formula m to n, where m and n correspond to any one 
of the amino acid residues specified above for these symbols, respectively. 
Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

The following examples are illustrative only and are not intended to limit 
the scope of the invention as defined by the appended claims. It will be apparent 
to those skilled in the art that various modifications and variations can be made 
in the methods of the present invention without departing from the spirit and 
scope of the invention. Thus, it is intended that the present invention cover the 
modifications and variations of this invention provided they come within the 
scope of the appended claims and their equivalents. 

Examples 

Example 1 - DNA Isolation and Purification 

DNA was isolated from cultures of NRRL B- 1 1 474 cells. NRRL B- 1 1 474 
cells were harvested from CM media (Table B) and suspended in 10 ml of TE pH 
8 (10 mM Tris*Cl, 1 mM EDTA). Forty micrograms of RNase A and 10 
milligrams of lysozyme were added per milliliter of suspension and the 
suspension was incubated at 37°C for 30 minutes. The suspension was made 
1 .0% in sodium dodecyl sulfate (SDS) and 0. 1 mg/1 proteinase K was added, and 
the cells were lysed by incubation at 37°C for 10 minutes. Nucleic acids were 
purified by three extractions with TE-saturated phenol (pH 7), followed by 
ethanol precipitation. Nucleic acid precipitates were twice washed with 80% 
ethanol and redissolved in TE pH 8. The concentrations of DNA were quantified 



28 



spectrophotometrically at 260 nm. Purity of DNA preparations were determined 
spectrophotometrically (A260/A280 and A260/A230 ratios) and by agarose gel 
electrophoresis (0.8% agarose in IxTAE). 

Sequencing of genomic DNA was performed, as is known by one of 
ordinary skill in the art, by creating libraries of plasmids and cosmids using 
pGEM3 and Lorist 6, respectively. The C. glutamicum pgi gene was identified 
by homology to glucose-6-phosphate isomerase of Mycobacterium tuberculosis 
(Swiss Prot Accession number P77895, Swiss Prot ID G6PIJVIYCTU). 

Example 2 - Increasing NADPH Availability by Disrupting pgi 

An increase in carbon flux through the oxidative branch of the pentose 
phosphate pathway was achieved by disrupting the pgi gene which encodes 6- 
phosphoglucose isomerase. Two PCR primers were designed from the genomic 
DNA sequence described above to facilitate the amplification of a 680 bp internal 
region of the C glutamicum pgi gene. These primers were: 

pgif* (SEQ ID NO:3) 5' gctgatgtccacgaagctttgggac 3' 

pgir* (SEQ ID NO: 4) 3' gctgagaaccttggaataaggtagg 3' 

Primers pgif* and pgir* contain the recognition sequence for the 
restriction enzyme Hind III. In the case of pgir*, it was necessary to make three 
changes from the C. glutamicum nucleotide sequence to incorporate the Hind III 
recognition sequence. These Hind III restriction sites facilitated subcloning. 

PCR amplification conditions were employed as follows. The final 
volume of each PCR reaction was 1 00 ul. 1 00 ng of each primer was used along 
with 50 ng of high molecular weight C. glutamicum ATCC 2 1 799 genomic DNA 
and 2.5 units of Taq DNA polymerase. Reaction buffer was included at a 
concentration recommended by the manufacturer (Stratagene) and dNTPs were 
also included at a final concentration of 200 uM. Cycling parameters were as 
follows: 94 °C for 1 minute, followed by 94 °C for 30 seconds, 60 °C for 30 
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seconds and 72 °C for 1 minute (30 cycles), 72 °C for 7 minutes followed by 
refrigeration. 

On restriction with Hind III, the PCR product was reduced in size to 
approximately 660 bp. This fragment was then subcloned into the suicide vector 
pBGS 13 1 . The resulting subclone was designated pDPTpgi2. 

Following electrotransformation into competent C. glutamicum (NRRL 
B 11474) cells, integrants were selected for on CM (Table B) agar plates 
containing kanamycin at a final concentration of 10 |ig/ml. Enzyme assay 
confirmed the absence of phosphoglucose isomerase activity in the mutant strains, 
indicating that the pgi gene in these strains had been disrupted. 

Shake flask experiments indicate that the C. glutamicum (NRRL B 1 1 474) 
pgi mutants have improved lysine titers and yields when compared to C. 
glutamicum (NRRL Bl 1474) (Table A). 

Table A: Lysine production on FM3 (Table C) media 

Strain Growth Titer Yield 

NRRL B 11474 46 25 42 

NRRLB11474::pgi2A 40 31 52 

Growth = optical density at 660 nm 

Titer = grams of lysine / liter of medium 

Yield = (grams of lysine / grams of glucose consumed)* 1 00 
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Table B: 

Volume: 



CM Media 

1000 ml 



Sucrose 
KH2 P04 
K2HP04 
Urea 

MgS04 * 7H20 

Polypeptone 

Beef Extract 

Biotin 

Thiamine 

Niacinamide 

L-Methionine 

L-Threonine 

L-Alanine 

Bring to volume 

pH - about 7.1 



% Agar: 



0 



50 g 
0.5 g 
1.5 g 
3g 
0.5 g 
20 g 
5g 

12.5 ml (60mg/L) 

25 ml(120mg/L) 

25 ml (5g/L) 
0.5 g 
0.25 g 
0.5 g 

1000 mis with DI water. 



11120 
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Table C: FM3 Media 

Per liter 

(NH 4 ) 2 S0 4 

KH 2 P0 4 

MgS0 4 *7H 2 0 

MnS0 4 *H 2 0 

FeS0 4 *7H 2 0 

Biotin 

Corn Steep Liquor 

Glucose 

CaC0 3 



50g 

lg 

0.4g 

O.Olg 

O.Olg 

0.03mg 

4% dry solids final concentration 

6% final concentration 

50g 
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Example 3 - Disrupting the Gene Encoding 6-Phosphofructokinase (pfkA) 

The gene encoding for 6-phosphofructokinase (pfkA) was disrupted in a 
method similar to that described for the pgi gene in Example 2. Disruption of the 
pfkA gene was verified by enzyme assay of extracts of the mutants and showed 
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that 6-phosphofructokinase activity was lacking. Unexpectedly, the C. 
glutamicum (NRRL B-l 1474) pfkA mutants were unable to utilize glucose. 

***** 

All patents and publications referred to herein are hereby expressly 
incorporated in their entirety by reference. 

While the foregoing invention has been described in some detail for 
purposes of clarity and understanding, it will be appreciated by one skilled in the 
art from a reading of this disclosure that various changes in form and detail can 
be made without departing from the true scope of the invention and appended 
claims. 
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